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(57) Abstract: A method and a system for building and optimiz- 
ing search expressions is presented. The method is based on the 
reception of a primary search phrase, a search through a database 
for related search phrases along with associated Boolean opera- 
tors, a presentation of these related search phrases along with 
the Boolean operators to a user, a reception of information re- 
garding which modifying phrases the user has selected and the 
generation of a completed search expression to be forwarded to 
a search engine. The method also includes steps for registering 
the frequency with which search phrases and associated Boolean 
operators are used, as well as steps for registering new search 
phrases not already present in the database. 
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Method and system for interactive building and optimization of search 

expressions 

The invention relates to a method for interactively generating a search expression in a 
5 system acting as a front end to a search engine, and a computer system device 
performing the method. 

The amount of information available through interactive (two-way) communication 
networks like the Internet is growing rapidly, and the task of identifying particular 
items of information is becoming increasingly more difficult. It has become common 

10 for on-line users to utilize search engines to search the Internet for desired 

information, and many web sites permit users to perform searches to identify a small 
number of relevant items among a much larger domain of items. As an example, 
several web sites permit users to search for particular web sites in indexes of known 
web sites, e.g. Fast™, Google™. Similarly, many on-line merchants, such as 

15 Amazon™, permit users to search for particular products among all of the products 
that can be purchased from the merchant. Other on-line services, such as PubMed 
(National Library of Medicine) and Britannica™, allow users to search for various 
articles and documents. 

In order to perform a search, a user submits a query containing one or more query 
20 terms. A query server program of the search engine processes the query to identify 
any items that match the terms of the query. The set of items identified by the query 
server program is referred to as a «query result», which is typically presented to the 
user as a set of hyper-textual listing (hyperlinks) of the located items. 

If the scope of the search is large, the result of a simple query may result in hundreds, 
25 thousands or even millions of hits. In order to find a single item or a small set of 
items, the user is therefore required to build more complex queries involving 
numerous words and phrases, linked by default or specified Boolean operators (AND 
/ NOT / OR). The quality of the query result therefore relies heavily on the user's 
knowledge in the field of interest, as well as his or her understanding of Boolean 
30 logic. To users searching for an unfamiliar topic, or users unfamiliar with Boolean 
logic, searching the Internet is therefore a time-consuming and often futile process. 

Several search engines, such as Yahoo™, Directiffit™, and AltaVista™ attempt to 
assist the query building process by suggesting related query terms to the user as a 
part of a «search refinement* process. This allows the user to further refine the query 
35 and narrow the query result by selecting one or more related query terms that more 
accurately reflect the user's intended request. 
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There are several ways to generate a database of related query terms that can assist 
the user in the building of a more comprehensive query expression: 

1 . Related query terms may be generated by the search engine, using the contents 
of the query result, such as by identifying the most frequently used terms within the 

5 located documents. 

2. WO 99/66427 by Bowman et al. describes how query term correlation data 
may be extracted from the search queries submitted by a plurality of users. 

3. Related query terms may be copied or extracted from existing dictionaries, 
thesauruses, indexes of textbooks, and other written material. 

10 4. Lists of related query terms may be generated by individuals or committees, 
with experience in particular areas, e.g. a specific scientific discipline. 

Irrespectively of how it is generated, the information in the database of related query 
terms may be presented to the user in a manner that facilitates the building of more 
comprehensive search expressions. For example, a user submits a query on the term 

15 «CELL», which returns a query result of several thousand items. In order to assist the 
user in the refinement process, query terms related to «CELL» are looked up in the 
database and presented the user (e.g. the terms <<ANIMAL>>, «FUEL», and 
«PHONE»). The user is then invited to limit the query result, e.g. through an interface 
of hyperlinks, check boxes, or drop down menus, and the selected terms are added to 

20 the query, which returns a more restricted query result. 

But although often helpfiil, the existing technology for assisting users in refining 
search queries have major limitations, particularly related to the use of Boolean logic. 
Most importantly, none of the exciting systems can assist the user in which Boolean 
operators should be used in relation to specific query terms. The need for such 

25 guidance in Boolean logic is particularly evident in relation to the use of the word 
«or», which has nearly opposite meanings in everyday language and as a Boolean 
operator. E.g. the phrase «You will get fish or meat» implies that you will get one or 
the other. The query «FISH OR MEAT», on the other hand, returns all the items that 
contain the word «FISH», as well as, all the items that contain the word «MEAT», 

30 including items that contain both the word «FISH» and the word «MEAT». As the 
use of Internet search engines has become an everyday activity, also to people 
untrained in Boolean logic, this confusion between the exclusive and the inclusive 
«or» has grown to become a major problem. The current method for dealing with the 
problem is limited to hyperlinks to separate «help-page», where the use of Boolean 

35 operators is explained in a traditional textbook-like manner. There is no direct 
relationship to the query refinement process, and there is no assistance on which 
Boolean operator should be related to which query term. 
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In relation to the «or-problem», the existing systems for assisting query refinement 
are solely designed to remove unwanted items from the primary query result. 
Information of interest to the user, not comprised by the primary query can therefore 
not be picked up by these procedures. E.g. a search refinement process that start with 
5 the term «PETROL» will not be able to identify pages containing the synonyms 

«FUEL» or «GASOLINE», which could be of equal interest to the user. A user must 
therefore repeat the search procedure for a number of synonyms, and this task is 
further complicated by the fact that the synonyms must be provided by a user, whom 
might be totally new to the subject in question. In order to overcome these 
10 limitations, the search refinement process must allow, and assist, the use of the «OR»- 
operator, and this problem is not tackled by any present technology. 

Similarly, the best method for limiting a query result to a convenient set of relevant 
items often involve combination of terms that should, and should not, occur in the 
document. This implies queries where different terms are related to either the «AND» 
15 or the «NOT»-operator. The use of these operators are generally more intuitive, and 
can easily be assigned to specific terms by putting + or - signs in front of them. 
However, there is no existing technology that can assist the user in the process of 
selecting which operator should go with which term, or that can suggest purposeful 
combinations of terms and operators. 

20 The present invention addresses these and other problems by providing a search 
refinement system and method for building query expressions comprising different 
Boolean operators from a database of related query terms, and a system and method 
for generating such a database. The invention relates to a database of primary query 
terms correlated to a set of related query terms with designated Boolean operators. 

25 On submitting a primary query term, the user is presented the set of related query 
terms in a context that indicates and clarifies their assigned Boolean operator. This 
interface allows the user to select the combinations of terms and operators which 
should be included in the final query expression, and my also allow the user to switch 
between operators, or add additional terms with assigned operators. 

30 The database of related query terms may be generated by one of or by different 

combinations of the methods listed above. Uniquely to the present invention, related 
terms are assigned specific Boolean operators, reflecting their most appropriate 
function related to the primary query term. 

In accordance with one aspect of the invention, each related query term has a counter 
35 for each of the three Boolean operators (AND / NOT / OR), and in one embodiment, 
these counters are updated automatically from the user constructed query expressions 
submitted to the search engine. Thus, the entire community of users can function as a 
committee, which by the building of their own query expressions «vote» for the most 
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appropriate combinations of terms and operators, and thereby share their own 
judgment and knowledge with the other users. Similarly, new combinations of related 
terms and operators contributed by the user during the query building process would 
contribute to the expansion of the database. In a preferred embodiment, the building 
5 of the database is accomplished through a combination of automatic updating of 
counters and an editorial quality control of terms that are new to the database. 

A dynamic user interface that communicates relationship between query terms and 
their assigned Boolean operator has a key function in the present invention. In 
accordance with one aspect of the invention, this involves interactive screen images 

10 that combine textual and graphic information, preferably composed from standard 

computer protocol, such as HTML, Java, or ASP. In one preferred embodiment, the 
user builds query expressions by marking checkboxes related to specific combinations 
of terms and operators. In one preferred feature of the interface involves a two-step 
procedure by which the user first is invited to expand the range of the query by 

15 selecting synonym terms with the OR operator, and second to limit (focus) the query 
by selecting terms with the AND or NOT operator. This will avoid the previously 
described problem of the «or-problem». To further assist the user, particularly useful 
combinations of terms and operator may be checked as default. 

In another preferred embodiment of the invention, the related query terms are 
20 presented to the user as hyperlinks to pages where the specific related term represents 
the primary term with a corresponding set of related terms and assigned Boolean 
operators. This feature links the terms in the database together in a web of related 
terms in which the user can navigate to find the most appropriate primary term. 

The particular features of the invention are given in the attached independent claims, 
25 with the dependent claims describing additional features and embodiments. 

The invention will now be described in the form of an exemplary embodiment, with 
reference to the attached drawings. 

Fig. 1 Shows a block diagram of a possible embodiment of a computer system 
operating in accordance with the invention. 

30 Fig. 2 Shows the structure of the different tables in the search phrase database 
according to an embodiment of the invention. 

Fig. 3. Shows an example illustrating entries in the search phrase database. 

Fig. 4 Shows a flow chart illustrating an embodiment of the method according to 
the invention. 



35 



Fig. 5. 



Shows a suggested user interface for a user entering a search phrase. 
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Fig. 6 Shows a suggested user interface where a user is invited to select modifying 
phrases and Boolean operators. 

Figure 1 shows a block diagram illustrating a computer system 10 operating in 
accordance with the invention. The system comprises at least one processor unit 1 1 
5 capable of performing searches in a search phrase database 12. The data stored in the 
search phrase database 12 is accessible to the processor through input/output means 
13. The processor unit 1 1 preferably operates in accordance with instructions in the 
form of computer program software stored on a local storage device 14, such as a 
hard drive. The processor unit 1 1 also has access to system memory 15 for temporary 

10 loading and unloading of instructions and data. Finally the system 10 comprises 

input/output means 16 for receiving search phrases from a user computer 20 and for 
sending a presentation of suggested modifying phrases and Boolean operators to the 
. user computer 20. The system 10 also forwards completed search expressions to a 
search engine 30 over I/O-means 16, receives search results, and forwards these 

15 search results to the user computer 20. The communication between the computer 
system 10, the user computer 20 and the search engine 30 will normally take place 
over a public communication network 35 such as the Internet. 

In a preferred embodiment the computer system 10 also comprises an additional 
database 17. This database is used for storing search phrases that are not entered into 
20 the main database 12 when such phrases are entered by a user, and to register the 

number of times these phrases occur. The data in the additional database 17 may then 
be used in order to refine the contents of the main database J 2, either as a result of 
statistical analysis of the data collected in the additional database 17 over time, or 
based on evaluation of the data performed by humans. 

25 A person skilled in the art will understand that the system 10 may be constructed in a 
number of different ways. It is possible to include all the units described in one single 
computer, or they may be distributed among several different computers. As an 
example, the two databases 12, 17 may be installed on separate computers. If this is 
the case, the two I/O-units 13, 16 may be one and the same. If, on the other hand, the 

30 databases 12, 17 are installed on the local storage unit 14, the I/O-means 16 will be 
part of the internal structure of the computer including the processor 1 1 and 
communication between the different units will mainly take place on this single 
systems system bus. Alternatively, the system may consist of several processors 11 in 
on or several computers, such as a multiprocessor computer or a cluster. Any such 

35 particular configuration is available to a skilled man and it will be a question of design 
choice which configuration is found preferable when building a system according to 
the invention. 
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Turning now to figure 2, the structure of records and tables in a search phrase 
database according to an embodiment of the invention is illustrated. The database 
comprises at least two different tables, the phrase table and the phrase relation table. 
Preferably there is also a third table called the category table. Preferably, the phrase 
5 table contains all phrases in the database. Alternatively this table could contain only 
main phrases, i.e. phrases that are listed as primary search phrases, but for 
convenience and speed it has been fount preferable to include all phrases in this table. 

In the phrase table, every phrase is entered with one record for every category it 
appears in. For every relation between a main phrase and a modifying phrase, there is 
10 a record in the phrase relation table. The phrase relation table also includes the 
counters that indicate how often a phrase is used as a modifying phrase. 

The different fields of the records will now be described, starting with the phrase 
table. 

The first field is phrase id which is a key identifying the individual records of the 
15 phrase table, category Jd is a field that refers to a record in the category table. This 
indicates a field of search, such as a field within science, and will be used to make the 
suggested modifying phrases more targeted. This is explained in further detail below. 
It should be noted that while present in a preferred embodiment of the invention, the 
category table, and hence the category Jd field of the records in the phrase table, may 
20 be omitted. 

The following field is the phrase which is a character string containing the text of the 
actual phrase. As already mentioned, according to a preferred embodiment all 
phrases, whether they are primary phrases or only modifying phrases, are listed in the 
phrase table. In order to immediately identify a phrase as either a primary phrase or a 

25 modifying phrase, there is preferably a field called main j>hrase in the records of the 
phrase table, and this field simply indicates whether this phrase is only a modifying 
phrase, or also a main phrase. E.g. if the phrase «cell» is present both as a main 
phrase, or primary phrase, but also as a modifying phrase for the phrase «phone», this 
field will have the value YES. On the other hand, if the phrase «fast» has no entry as a 

30 primary phrase, only as a modifying phrase, e.g. for the primary phrase «food», the 
value of this field will be NO. 

Finally, the field search count is a counter that is incremented once every time the 
phrase is actually included in a completed search expression. Alternatively it is only 
incremented every time the phrase is used as a primary phrase, since modifying 
35 phrases are counted in the phrase relation table, as described below. 

In a preferred embodiment, the database comprises a category table, which functions 
to narrow down the search by making it possible to distinguish between identical 
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words that have different meaning within different fields of knowledge. As an 
example, one category may be biology, another may be telecommunications, and the 
word cell may then have two separate entries in the phrase database, each referring to 
a different record in the category database. The category records only contain the 
5 fields category Jd and category name. The first is the key identifying the category 
record; the second is the text string containing the name of the category. 

The records of the phrase relation table will now be described. Every record in this 
table defines a relation between a main phrase and a related phrase (or primary phrase 
and modifying phrase). The first field is the prase relationjd, which is a key 

10 identifying the individual records of this table. Following this is the search j>hrasejd y 
and then the related jphrasejd. In the embodiment described above, where all 
phrases are present in the phrase table, whether they are main phrases or only 
modifying phrases, the search jphrasejd field refers to the phrase Jd of a phrase that 
is listed as main phrase in the phrase table. E.g. if the phrase «cell» is present as a 

15 main phrase and has modifying phrases related to it, there will be one record in the 
search relation table for each such related phrase. The field search jphrasejd will 
then refer to the record for the phrase «cell» in the phrase table in each such entry in 
the phrase relation table. The related jphrasejd refers to a phrase modifying the 
search phrase. As an example, if the search j?hrase id refers to the phrase «celb>, the 

20 related jphrasejd may refer to the record for the phrase «cancer» in the phrase table. 

Normally a search phrase has several phrases listed as modifying phrases, and as 
already explained there will be one entry in the phrase relation table for each such 
modifying phrase. Using the example above, the phrase «celb> may have the phrases 
«cancer» and «phone» as modifying phrases. In this case there will be two entries in 
25 the phrase relation table. Both will refer to the entry for «celb> in the phrase table in 
their search jphrase jd fields, but in their related jphrase jd fields they will refer to 
the entries in the phrase table for the phrases «cancer» and «phone» respectively. 

This example does not take category into account. If the database does include a 
category table, however, a given phrase will be entered once in the phrase table for 

30 each category in which it occurs. In this is the case, one record for the phrase «cell» 
in the phrase relation table will in its search jphrasejd field reference the phrase 
«celb> of the category ^telecommunication)) and in its related jphrasejd Afield it will 
reference the phrase «phone» also of the category ((telecommunications)). Another 
record in the phrase relation table will in its search jphrasejd field reference the 

35 phrase «cell» of the category «biology» and in its related j)hrase Jd field it will 
reference the phrase «cancer» also of the category «biology)>. 

In general it may be stated that each record in the phrase relation table will refer to 
one and only one search phrase to be modified (in the search jphrasejd field) and 
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one and only one modifying phrase (in teh related jhraseid field), but that each 
record of the phrase table may be referred to by any number of records in the phrase 
relation table, both as main search phrase and as modifying search phrase. 

Following these are three fields called counter _plus y counter minus, and counter j>r. 
5 These counters are incremented every time the modifying phrase is used with the 
Boolean operators AND, NOT and OR, respectively. Alternatively there is only one 
counter indicating how many times the modifying phrase has been used, but not 
taking into account with which Boolean operator it has been used. 

Finally there is a field called basic. This field indicates whether the phrase should be 
10 used as a modifying phrase with the given primary phrase if the user has selected a 
basic search rather than an interactive search. This will be described in detail below. 

Figure 3 shows an additional example of entries into the database. Four records of the 
phrase table are shown, the entries for the phrases «inbreeding», «interbreeding», 
«product» and «products». These all have unique phrase id and they all belong to the 

15 same category. Only «inbreeding» can be a main phrase, the other phrases can only 
occur as modifying phrases, as indicated by the values of the main jphrase field. The 
phrase relation table defines the relationship between these phrases. There is one 
entry in the phrase relation table for each relationship. In each such entry the 
phrase Jd for the search phrase «inbreeding» is identified. Further, the phrase Jd for 

20 the respective modifying phrase is identified in the field related _phrasejd. In this 

way the record defines a link between a main phrase and a modifying phrase. The field 
relatedjphrase contains the text string for the relevant modifying phrase and 
duplicates the text string in the phrase field of the record in the phrase table 
referenced by the related jphrase id. The counter fields show that the phrase 

25 «interbreeding» has been used 1000 times with the Boolean operator AND, while 

«product» and «products» have been used 1000 times each with the Boolean operator 
NOT in search expressions with «inbreeding» as the primary phrase. None of the 
modifying phrases have been used with any other Boolean operators. 

The field basic indicates that if the user chooses to perform a basic search with 
30 «inbreeding» as the primary phrase, «interbreeding» should be included in the search 
expression, while «product» and «products» should not. Alternatively this field could 
also indicate which Boolean operator the phrase should be included with, or that it 
should not be included at all, e.g. by using the character +, -, 1 and 0 for AND, NOT, 
OR and 'not included', respectively. 

35 The additional database 17 has the same structure as the search database 12. Its 
function will be explained below. 
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Reference is now made to figure 4, where a flow chart illustrates the preferred 
embodiment of the invention. In a first step 101a primary input page generated by the 
system 10 is presented on a user's computer. An embodiment of this primary input 
page is illustrated in figure 4, which will be described below. The user inputs 102 his 
5 primary search phrase, and possible additional phrases. The primary search phrase is 
then sent 105 to the search phrase database 12 and a search 106 is made for a record 
in the phrase table where the field phrase, and in a preferred embodiment also the 
category referenced by the field category Jd, corresponds to the input from the user. 
If the phrase is not found, or if the field main jphrase indicates that this phrase is only 

10 listed as a modifying phrase in the database, no modifying phrases are returned 107 
from the database, only the original primary phrase. Otherwise, a search is made 
through the phrase relation table for records referencing the primary phrase in their 
related jphrase field. All phrases that are found this way are returned 107 along with 
the primary phrase. In a preferred embodiment the phrases are returned with a 

15 suggested Boolean operator. The suggested Boolean operator will be the operator 

with the highest number of occurrences according to the counter fields. According to 
a preferred embodiment of the invention two operators may be suggested, one being 
OR, the other being AND or NOT. This is for purposes of generating a user friendly 
interface, as described below with reference to figure 5. 

20 The phrases returned 107 from the database are combined with the additional user 
phrases 104 in a following step 108. Based on this a second input page is generated 
and presented to the user 1 10. An embodiment of this input page is illustrated in 
figure 5. After the user has selected which modifying phrases he or she whishes to 
include in the search expression and with which Boolean operators, a completed 

25 search expression is generated 1 12 and sent to the search engine 30. The search 

engine performs the search 113, the results are returned 1 14 to the computer system 
10, and a result presentation is generated and sent 1 15 to the user computer 20, 
preferably as HTML. 

The completed search expression is also used to update the database 12 and the 
30 additional database 17 in the following manner. In a first step 1 16 the search database 
is updated by incrementing the above described counters as follows. For the primary 
phrase, and optionally for all the modifying phrases, the field search count is 
incremented by one. For the modifying phrases, the counter associated with the 
Boolean operator chosen in the completed search expression is incremented by one. 

35 If the completed search expression included primary and/or secondary phrases that 
were not originally found in the search phrase database 12, these are sent 1 17 to the 
additional database. This database is updated 1 18 in a following step. For any phrase 
not already present in this database, a new record is created in the phrase table. If the 
phrase was used as a modifying phrase and there is no record already present in the 
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phrase relation table, a record is created in this table, the search jphrasejd is set to 
reference the same phrase in the phrase table, and the related jphrase jd is set to 
reference the primary phrase of the completed search expression. Then the counters 
are updated in the same way as for the search database 12. 

5 With regular intervals, the data accumulated in the additional database 17 will be 
analyzed 1 19 for possible inclusion in the search phrase database 12. This may be 
done automatically by statistical analysis, or manually e.g. by an editorial board. As a 
result, the search phrase database 12 will be updated to include search phrases an 
phrase relations actually used by users, making sure the suggestions made by the 
10 system according to the invention take advantage of the experience of its users. 

In a preferred embodiment of the invention the user may, when presented with the 
second input page, choose to use one of the suggested modifying phrases as primary 
phrase if the phrase is listed in the database as a main phrase. Preferably this will be 
indicated by presenting the phrase as a hyperlink. All the user has to do then is to 
15 click on the phrase, and a new search will be initiated 120 with the relevant phrase as 
primary phrase. 

If, when presented with the second input page, the user chooses to change the 
category, a new search will be initiated 121 with the same primary phrase, but with a 
different category. 

20 Figure 5 illustrates a suggested user interface of an input page 50 that will be 

displayed on the user computer 20 and that invites the user to enter a primary search 
phrase 51 and possible additional phrases 52, choose a category 53 and choose either 
basic search 54 or interactive search 55. If the user chooses interactive search, the 
primary phrase, along with any additional phrases and the category, are sent to the 

25 , front end computer system 1 0. There the method discussed above will be performed 
and a second input page will be generated and returned to the user computer 20. If 
the user chooses basic search, no second input window will be generated. The 
modifying phrases indicated as basic, as described above, will be included in the 
search expression with their default Boolean operators. The input page 50 will 

30 preferably be displayed as a HTML page in a web browser ru nnin g on the user 
computer 20. 

Figure 6 illustrates a suggested user interface 60 that will be generated by the 
computer system 10 and sent to the user computer 20, inviting the user to refine his 
or her search. In a first field 61, phrases that will expand the search, e.g. phrases with 
35 OR as their suggested Boolean operator, will be presented. In a preferred 

embodiment, the phrases that are included in basic searches are presented as already 
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selected 62, while other phrases are presented but not selected 63. The user is of 
course free to select or deselect any of the phrases. 

In a second field 64, phrases that narrow the search will be presented. These are 
phrases that have AND or NOT as their suggested Boolean operator. The phrases will 
5 preferably be presented with either AND (+) 65 or NOT (-) 66 already selected, based 
on the highest counter value in the fields counter j?lus and counter jninus described 
above. Alternatively only the phrases included in basic searches are preselected. It 
would still be possible to indicate which choice is suggested, e.g. by highlighting the 
alternative with the highest counter value 67. Phrases that also occur as main phrases 
10 are preferably presented as hyperlinks 68. The user can then initiate a new search 

within the same category by clicking on the hyperlinked word 68. The user may also 
change the category 69. 

If the user clicks on a hyperlinked word 68, only the relevant phrase and the category 
need be returned to the computer system 10. Likewise, if the user changes category 

15 69, the new category will be returned along with the original primary phrase. Both 
these alternatives initiates a new look up in the search phrase database 12 and the 
generation of a new second input page. However, if the user submits the contents of 
the entire input page to the computer system 10 where a search expression is 
generated based on the selections made. The input window preferably includes fields 

20 70, 71 where the user may include additional expanding 70 and limiting 71 phrases. 
The input window also includes a button 72 or some other means for the user to 
select in order to submit the selections to the computer system 10. 

The selections illustrated in figure 6, with «bacteriab> selected as an expanding phrase, 
«cytoplasma» entered as an additional expanding phrase, «cancer» and «brain» 
25 selected as limiting phrases with the Boolean operator AND, «junk food» selected as 
a limiting phrase with the Boolean operator NOT, and «human» entered as an 
additional limiting phrase with the Boolean operator AND, will result in the following 
completed expression being generated by the computer system 10: 

(cell OR bacterial OR cytoplasma) AND cancer AND brain AND human NOT junk 
30 food 

The invention has been described by way of an example where phrases and phrase 
relations are ordered in separate tables of a database. However, it must be understood 
that within the scope of the invention it would be possible to create a system with a 
different organization of data while still operating in accordance with the invented 
35 method. The example must therefore be understood not in a limiting sense, whereas 
the scope of the invention is defined by the attached claims. 
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CLAIMS 

1 . Method for interactively generating a search expression in a computer system 
acting as a front end to a search engine, 

characterized in the steps of 
5 - receiving a primary search phrase and optional secondary search phrases as input, 

- looking up in a search phrase database any modifying phrase listed as related to the 
primary search phrase, 

- returning at least some such modifying phrases along with the primary search phrase 
and combining them with any secondary search phrases, 

10 - generating a first output message including all such modifying phrases, along with 
the primary search phrase and any secondary search phrases, 

- transmitting said message including a request for a second input message indicating 
which modifying phrases should be included in a completed search expression, 

- receiving information indicating which modifying phrases have been selected for 
15 inclusion in the completed search expression; and 

- sending the completed search expression to the search engine. 

2. Method according to claim 1, 

characterized in that said modifying phrases are included in the first 
output message along with suggested Boolean operators and that any selected 
20 modifying phrase is included in the completed search expression with the suggested 
Boolean operator. 

3. Method according to claim 2, 

characterized in that the inclusion of suggested Boolean operators in 
the first output message also includes alternative Boolean operators that may be 
25 substituted for the suggested Boolean operators in the completed search expression. 

4. Method according to claim 1, 

characterized in that the completed search expression is returned to the 
search phrase database where respective counters are incremented for the primary 
search phrase and all modifying phrases, to the extent said phrases are included in the 
30 database. 

5. Method according to claim 3, 

characterized in that the completed search expression is returned to the 
search phrase database where respective counters are incremented for the primary 
search phrase and all modifying phrases in combination with the Boolean operator 
35 chosen, to the extent said phrases are included in the database. 
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6. Method according to claim J , 

characterized in that any phrase that is part of the completed search 
expression and not included in the search phrase database is sent to a second database 
where 

- if not already present, a record is created for said phrase, and 

- a counter is incremented for the phrase, 

7. Method according to claim 6, 

characterized in that the record created in the second database 
indicates whether said phrase was a primary phrase or a modifying phrase, and in the 
case that the phrase was a modifying phrase said record also indicates the primary 
phrase with which it is related, 

8. Method according to one of the previous claims, 

characterized in that all primary phrases in the search phrase database 
belong to a category, and that any occurrence of the same phrase in more than one 
category are treated as individual phrases, and that any primary search phrase 
received from a user also indicates one such category. 

9. Computer system for, based on user input and user feedback, generating 
search expressions to be forwarded to a search engine, 

characterized in that the computer system (1 0) comprises a processor 
unit (11), a first database (12), a local storage unit (14) and input/output means (13, 
16), said first database (12) containing a list of search phrases and a list of 
relationships between main search phrases and modifying search phrases, and said 
local storage unit (14) containing a set of instructions that enables the processor unit 
01) to 

- receive, over the input/output means (16), first input message in the form of a 
primary search phrase and optional secondary search phrases, 

- look up in the search phrase database (12) any modifying phrase listed as related to 
the primary search phrase, 

- generate a first output message including at least some such modifying phrases 
along with the primary search phrase and any secondary search phrases, 

- transmitting, over the input/output means (16), said message including a request for 
a second input message indicating which modifying phrases should be included in a 
completed search expression, 

- receiving, over the input/output means (16), said second input message indicating 
which modifying phrases have been selected for inclusion in the completed search 
expression, 

- generating a second output message with a completed search expression including 
said main search phrase, said secondary search phrases and said modifying search 
phrases, and 
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- sending, over the input/output means (16), the message containing the completed 
search expression to the search engine. 

1 0, Computer system according to claim 9, 

characterized in that the list in said first database containing 
5 relationships between main search phrases and modifying search phrases includes 
information on a Boolean operator associated with the modifying search phrase and 
that the modifying search phrases are accompanied with their associated Boolean 
operator when they are included in said first output message. 

1 1 . Computer system according to claim 1 0, 

10 characterized in that instructions stored on the local storage unit (14) 
enable the processor unit () 1) to generate said first output message in the form of a 
HTML document, that, when displayed on a user computer (20) will display the 
modifying search phrases along with said associated Boolean operators as well as 
alternative Boolean operators. 

15 12. Computer system according to claim 9, 

characterized in that said list of relationships between main search 
phrases and modifying search phrases includes counter variables, and that instructions 
stored on said local storage unit (14) enable the processor unit to examine the 
completed search expression and increment counters for each modifying search phrase 

20 included in the completed search expression as related to the main search phrase of 
the completed search expression. 

13 . Computer system according to claim 1 1, 

characterized in that there are one of said counter variables for each 
alternative Boolean operator, and that the counter updated is the counter associated 
25 with the Boolean operator with which the modifying search phrase was associated in 
the completed search expression. 

14. Computer system according to claim 9, 

characterized in that it includes an additional database (17) with the 
same format as the first database (12), and that the local storage unit (14) includes 
30 instructions that enable the processor unit (1 1) to identify any search phrase included 
in the completed search expression that is not included in the list of search phrases on 
the first database, to register any such phrase in the additional database (17), if not 
already registered there, and to increment a counter representing the number of times 
said phrase has been used in a completed search phrase. 

35 15. Computer system according to one of the claims 9 to 14, 

characterized in that said first database (12) comprises 

- a phrase table containing one record for every phrase in the database, said record 
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comprising a first field containing a phrase identifier uniquely identifying the phrase, a 
second field containing a text string representation of the phrase, and a third field 
containing an indication on whether said phrase is listed as a main phrase or only as a 
modifying phrase, and 

5 - a phrase relation table containing one record for every relation between a modifying 
phrase and a main phrase, said record comprising a first field containing a phrase 
relation identifier uniquely identifying the phrase relation, a second field containing a 
search phrase identifier referring to one record in the phrase table representing the 
phrase to be modified (main phrase), a third field containing a related phrase identifier 

10 referring to one record in the phrase table representing the modifying phrase, and at 
least one additional field representing a counter indicating how many times said 
relation has occurred in a completed search expression. 

16. Computer system according to claim 1 5, 

characterized in that said at least one additional field in the phra9e 
15 relation table is a fourth field representing a counter indicating how many times said 
relation has been connected by an AND operator, a fifth field representing a counter 
indicating how many times said relation has been connected by a NOT operator, and a 
sixth field representing a counter indicating how many times said relation has been 
connected by an OR. operator. 

20 17. Computer system according to claim 15 or 16, 

characterized in that said first database (12) includes a category table 
containing one record for each field of knowledge or theme to which a search phrase 
may belong, said record comprising a first field defining a category identifier uniquely 
identifying the category, and a second field containing a text string representing the 

25 category, and 

- that the records of the phrase table comprises a fourth field containing a category 
identifier referring to one record in the category table representing the category to 
which the phrase belongs. 

1 8. Computer program product comprising program instructions for causing a 
30 computer to perform the method of any of the claims 1-8. 

1 9. Computer program product according to claim 18, 
characterized in that it is embodied on a record medium. 

20. Computer program product according to claim 18, 
characterized in that it is stored in a computer memory. 

35 21. Computer program product according to claim 1 8, 

characterized in that it is carried on an electrical carrier signal. 
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